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DETAILED ACTION 
Information Disclosure Statement 

1 . The information disclosure statement (IDS) submitted on 6/5/03 is considered by 
the examiner. 

Specification 

2. The disclosure is objected to because of the following informalities: on page 1 of 
the specification, the blank spaces of the "RELATED APPLICATIONS" section needs to 
be updated with correct application numbers and dates to provide accurate information. 

Appropriate correction is required. 

Claim Rejections - 35 USC § 102 

3. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 1 02 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in a patent granted on an application for patent by another filed in the 
United States before the invention thereof by the applicant for patent, or on an international application 
by another who has fulfilled the requirements of paragraphs (1). (2), and (4) of section 371 (c) of this 
title before the invention thereof by the applicant for patent. 

The changes made to 35 U.S.C. 102(e) by the American Inventors Protection Act 
of 1999 (AlPA) and the Intellectual Property and High Technology Technical 
Amendments Act of 2002 do not apply when the reference is a U.S. patent resulting 
directly or indirectly from an international application filed before November 29, 2000. 
Therefore, the prior art date of the reference is determined under 35 U.S.C. 102(e) prior 
to the amendment by the AlPA (pre-AlPA 35 U.S.C. 102(e)). 
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4. Claims 1 , 2, 4-14, 16-26 and 28-36 are rejected under 35 U.S.C. 102(e) as being 
anticipated by De Bonet (6,510,177). 

Regarding claim 1, De Bonet discloses a method of coding video (fig.2, element 
220), comprising the steps of: 

encoding an uncoded video to generate extended base layer reference frames 
(fig.2, element 220 is the layered video encoder that utilizes a base layer module 224 
and an enhancement layer module 228 to generate base layer reference frames), each 
of the extended base layer reference frames including a base layer reference frame 
and at least a portion of an associated enhancement layer reference frame (in fig.9, De 
Bonet discloses the "extended" or enhanced base layer reference frames include base 
layer frames, as inputted in element 905, and at least a portion of an associated 
enhancement layer reference frame, as inputted in element 925 where high-resolution 
motion vectors are the "portion of an associated enhancement layer reference frame", 
and also, in col. 13, ln.48-50, note base layer and high resolution motion vectors are 
used for prediction where the reference frames, I and P frames, are used to obtain B- 
frames, thus prediction requires reference frames); and 

generating frame residuals from the uncoded video and the extended base layer 
reference frames (fig.6, element 655 generates frame residuals from uncoded video 
and fig.9, element 945 is where the frame residuals are generated from the extended 
base layer reference frames). 

Regarding claim 2, De Bonet discloses a method of coding video according to 
claim 1 , further comprising the step of coding the frame residuals with a scalable codec 
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selected from the group consisting of DCT based codecs or wavelet based codecs to 
generate enhancement layer frames (col.13, ln.30-37; DCT based scalable coding or 
wavelet coding is applied to the frame residuals to generate enhancement layer 
frames, as seen in fig.9, the frame residuals are coded in element 955, and then, in 
element 960, enhancement layer frames are generated). 

Regarding claim 4, De Bonet discloses a method of coding video according to 
claim 1, wherein the frame residuals include B frame residuals (col.12, ln.41-44 and 
fig.6, element 655 calculates the B frame residuals). 

Regarding claim 5, De Bonet discloses a method of coding video according to 
claim 4, wherein the frame residual further include P frame residuals (col.12, ln.41-44 
and fig.6, element 655 calculates the P frame residuals). 

Regarding claim 6, De Bonet discloses a method of coding video according to 
claim 1, wherein the frame residual include P frame residuals (col.12, ln.41-44 and 
fig.6. element 655 calculates the P frame residuals). 

Regarding claim 7, De Bonet discloses a method of decoding a compressed 
video having a base layer stream and an enhancement layer stream (fig.2, element 
260 has a base layer decoder module 270 and an enhancement layer decoder module 
280), the method comprising the steps of: 

decoding the base layer and enhancement layer streams to generate extended 
base layer reference frames (fig.2, element 260 decodes the base layer, via element 
270, and the enhancement layer, via element 280, to generate "extended" or enhanced 
base layer reference frames for viewing an enhanced video output), each of the 
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extended base layer reference frames including a base layer reference frame and at 
least a portion of an associated enhancement layer reference frame (in fig.1 1 A, module 
1020, at element 1 124, the base layer reference frame data is received, and at element 
1 133, the "at least a portion of an associated enhancement layer reference frame" or 
residual P-frames are received, as disclosed in col. 17, lines 1-8, and then the base 
layer reference frame data and the "at least a portion of an associated enhancement 
layer reference frame" are summed together at element 1 142 to form high resolution P- 
frames); and 

predicting frame residuals from the extended base layer reference frames 
(col. 17, In. 1-1 4, in fig.1 1 A, frame residuals are decoded and predicted from the 
"extended" or enhanced base layer reference frames by utilization and application of 
the enhancement of the upsampled motion vectors using additional enhancement layer 
motion refinement data and the overlapped block motion compensation). 

Regarding claim 8, De Bonet discloses a method of decoding video according to 
claim 7, further comprising the step of decoding the frame residuals with scalable 
decoding selected from the group consisting of DCT based decoding or wavelet based 
decoding (col. 16, ln.52-55; clearly, DCT based decoding must be done for proper 
decoding since DCT based coding is used in the encoding stage, and frame residuals 
are decoded in elements 1 1 1 2 and 1 1 36 of fig. 1 1 A and element 1 1 63 of fig. 1 1 B). 

Regarding claim 9, De Bonet discloses a method of decoding video according to 
claim 8, further comprising the steps of: 
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generating enhancement layer frames from the frame residuals (fig. 10 is a 
general description of the enhancement layer decoder, where fig.1 1A and 1 1B show 
the specifics of generating enhancement layer frames from I, P and B frame residuals); 
and 

generating an enhanced video from the base layer frames and the enhancement 
layer frames (fig.2, note element 260 is the layered video decoder that generates an 
enhanced video from the utilization of the base layer decoder module 270 and the 
enhancement layer decoder module 280, where in col. 17, ln.44-50, fig. 1 1B, element 
1 184 is where the appending of the high resolution frames occur and preparation of the 
high resolution frames for viewing at element 1 187 is sent to a display like element 290 
of fig.2 or element 146 of fig.1). 

Regarding claim 10, De Bonet discloses a method of decoding video according 
to claim 7, wherein the frame residuals include B frame residuals (fig.1 1B, note 
"Residual B-frames" is decoded in element 1163). 

Regarding claim 1 1 . De Bonet discloses a method of decoding video according 
to claim 10, wherein the frame residuals further include P frame residuals (fig. 11 A, note 
"Residual P-frames" is decoded in element 1 136). 

Regarding claim 12, De Bonet discloses a method of decoding video according 
to claim 7, wherein the frame residuals include P-frame residuals (fig.1 1A, note 
"Residual P-frames" is decoded in element 1 136). 

Regarding claim 13, De Bonet discloses a memory medium for coding video 
(col.5, ln.30-48; fig.1 is a computer system configuration, including system memory, 
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harddrive, magnetic disk drive, optical disc drive, keyboard input, processor (CPU), to 
execute instructions, subroutines or software code for coding video, where fig.2, 220 is 
a layered video coder), the memory medium comprising: 

code for encoding an uncoded video to generate extended base layer reference 
frames (fig.2, element 220 is the layered video encoder that utilizes a base layer 
module 224 and an enhancement layer module 228 to generate base layer reference 
frames), each of the extended base layer reference frames including a base layer 
reference frame and at least a portion of an associated enhancement layer reference 
frame (in fig.9, De Bonet discloses the "extended" or enhanced base layer reference 
frames include base layer frames, as inputted in element 905, and at least a portion of 
an associated enhancement layer reference frame, as inputted in element 925 where 
high-resolution motion vectors are the "portion of an associated enhancement layer 
reference frame" , and also, in col.13, ln.48-50, note base layer and high resolution 
motion vectors are used for prediction where the reference frames, I and P frames, are 
used to obtain B-frames, thus prediction requires reference frames); and 

code for predicting frame residuals from the uncoded video and the extended 
base layer reference frames (fig.6, element 655 generates frame residuals from 
uncoded video and fig.9, element 945 is where the frame residuals are generated from 
the extended base layer reference frames). 

Regarding claim 14, De Bonet discloses a memory medium for coding video 
according to claim 13, further comprising code for scalable encoding the frame 
residuals (col.13, ln.30-37; DCT based scalable coding is applied to the frame 
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residuals to generate enliancement layer franfies, as seen in fig.9, the frame residuals 
are coded in element 955, and then, in element 960, enhancement layer frames are 
generated). 

Regarding claim 16, De Bonet discloses a memory medium for coding video 
according to claim 13, wherein the frame residuals include B frame residuals (col. 12, 
ln.41-44 and fig.6, element 655 calculates the B frame residuals). 

Regarding claim 17, De Bonet discloses a memory medium for coding video 
according to claim 16, where in the frame residuals further include P frame residuals 
(col. 12, ln.41-44 and fjg.6, element 655 calculates the P frame residuals). 

Regarding claim 18, De Bonet discloses a memory medium for coding video 
according to claim 13, wherein the frame residuals include P frame residuals (col. 12, 
ln.41-44 and fig.6, element 655 calculates the P frame residuals). 

Regarding claim 19, De Bonet discloses a memory medium for decoding a 
compressed video having a base layer stream and an enhancement layer stream 
(col.5, ln.30-48; fig.1 is a computer system configuration, including system memory, 
harddrive, magnetic disk drive, optical disc drive, keyboard input, processor (CPU), to 
execute instructions, subroutines or software code for decoding compressed video, 
where fig.2, 260 is a layered video decoder having a base layer decoder module 270 
and an enhancement layer decoder module 280), the memory medium comprising: 

code for decoding the base layer and enhancement layer streams to generate 
extended base layer reference frames (fig.2, element 260 decodes the base layer, via 
element 270, and the enhancement layer, via element 280, to generate "extended" or 




Application/Control Number: 09/930,672 Page 9 

Art Unit: 2613 

enhanced base layer reference frames for viewing an enhanced video output), each of 
the extended base layer reference frames including a base layer reference frame and 
at least a portion of an associated enhancement layer reference frame (in fig.1 1A, 
module 1020, at element 1 124, the base layer reference frame data is received, and at 
element 1 133, the "at least a portion of an associated enhancement layer reference 
frame" or residual P-frames are received, as disclosed in col. 17, lines 1-8, and then the 
base layer reference frame data and the "at least a portion of an associated 
enhancement layer reference frame" are summed together at element 1 142 to form 
high resolution P-frames); and 

code for predicting frame residuals from the extended base layer reference 
frames (col.17, ln.1-14, in fig.1 1A, frame residuals are decoded and predicted from the 
"extended" or enhanced base layer reference frames by utilization and application of 
the enhancement of the upsampled motion vectors using additional enhancement layer 
motion refinement data and the overlapped block motion compensation). 

Regarding claim 20, De Bonet discloses a memory medium for decoding a 
compressed video according to claim 19, further comprising code for scalable decoding 
the frame residuals, the code for scalable decoding selected from the group consisting 
of DCT based code or wavelet based code (col.16, ln.52-55; clearly, DCT based 
decoding must be done for proper decoding since DCT based coding is used in the 
encoding stage, and frame residuals are decoded in elements 1112 and 1 136 of 
fig. 11 A and element 1163 of fig.1 IB). 
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Regarding claim 21 , De Bonet discloses a memory medium for decoding a 
compressed video according to claim 20, further comprising: 

code for generating enhancement layer frames from the frame residuals (fig. 10 is 
a general description of the enhancement layer decoder, where fig.1 1 A and 1 1 B show 
the specifics of generating enhancement layer frames from I, P and B frame residuals); 
and 

code for generating an enhanced video from the base layer frames and the 
enhancement layer frames (fig.2, note element 260 is the layered video decoder that 
generates an enhanced video from the utilization of the base layer decoder module 
270 and the enhancement layer decoder module 280, where in col. 17, ln.44-50, 
fig.1 1B, element 1 184 is where the appending of the high resolution frames occur and 
preparation of the high resolution frames for viewing at element 1 187 is sent to a 
display like element 290 of fig.2 or element 146 of fig.1). 

Regarding claim 22, De Bonet discloses a memory medium for decoding a 
compressed video according to claim 19, wherein the frame residuals include B frame 
residuals (fig.1 IB, note "Residual B-frames" is decoded in element 1163). 

Regarding claim 23, De Bonet discloses a memory medium for decoding a 
compressed video according to claim 22, wherein the frame residuals further include P 
frame residuals (fig.1 1A, note "Residual P-frames" is decoded in element 1136). 

Regarding claim 24, De Bonet discloses a memory medium for decoding a 
compressed video according to claim 19, wherein the frame residuals include P frame 
residuals (fig.l 1A, note "Residual P-frames" is decoded in element 1 1 36). 
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Regarding claim 25, De Bonet discloses an apparatus for coding video (fig.2, 
element 220), the apparatus comprising: 

means for encoding an uncoded video to generate extended base layer 
reference frames (fig.2, element 220 is the layered video encoder that utilizes a base 
layer module 224 and an enhancement layer module 228 to generate base layer 
reference frames), each of the extended base layer reference frames including a base 
layer reference frame and at least a portion of an associated enhancement layer 
reference frame (in fig.9, De Bonet discloses the "extended" or enhanced base layer 
reference frames include base layer frames, as inputted in element 905, and at least a 
portion of an associated enhancement layer reference frame, as inputted in element 
925 where high-resolution motion vectors are the "portion of an associated 
enhancement layer reference frame" , and also, in col. 13, ln.48-50, note base layer and 
high resolution motion vectors are used for prediction where the reference frames, I 
and P frames, are used to obtain B-frames, thus prediction requires reference frames); 
and 

means for predicting frame residuals from the uncoded video and the extended 
base layer reference frames (fig.6, element 655 generates frame residuals from 
uncoded video and fig.9, element 945 is where the frame residuals are generated from 
the extended base layer reference frames). 

Regarding claim 26, De Bonet discloses an apparatus for coding video according 
to claim 25, further comprising means for scalable encoding the frame residuals 
(col. 13, ln.30-37; DCT based scalable coding is applied to the frame residuals to 
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generate enhancement layer frames, as seen in fig.9, the frame residuals are coded in 
element 955, and then, in element 960, enhancement layer frames are generated). 

Regarding claim 28, De Bonet discloses an apparatus for coding video according 
to claim 25, wherein the frame residuals include B frame residuals (col. 12, ln.41-44 and 
fig.6, element 655 calculates the B frame residuals). 

Regarding claim 29, De Bonet discloses an apparatus for coding video according 
to claim 28, wherein the frame residuals further include P frame residuals (col. 12, 
ln.41-44 and fig.6, element 655 calculates the P frame residuals). 

Regarding claim 30, De Bonet discloses an apparatus for coding video according 
to claim 25, wherein the frame residuals include P frame residuals (col. 12, ln.41-44 and 
fig.6, element 655 calculates the P frame residuals). 

Regarding claim 31 , De Bonet discloses an apparatus for decoding a 
compressed video having a base layer stream and an enhancement layer stream 
(fig.2, element 260 is a layered video decoder having a base layer decoder module 270 
and an enhancement layer decoder module 280), the apparatus comprising: 

means for decoding the base layer and enhancement layer streams to generate 
extended base layer reference frames (fig.2, element 260 decodes the base layer, via 
element 270, and the enhancement layer, via element 280, to generate "extended" or 
enhanced base layer reference frames for viewing an enhanced video output), each of 
the extended base layer reference frames including a base layer reference frame and 
at least a portion of an associated enhancement layer reference frame (in fig.1 1 A, 
module 1020, at element 1 124, the base layer reference frame data is received, and at 
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element 11 33, the "at least a portion of an associated enhancement layer reference 
frame" or residual P-frames are received, as disclosed in col. 17, lines 1-8, and then the 
base layer reference frame data and the "at least a portion of an associated 
enhancement layer reference frame" are summed together at element 1 142 to form 
high resolution P-frames); and 

means for predicting frame residuals from the extended base layer reference 
frames (col. 17, In. 1-14, in fig.llA, frame residuals are decoded and predicted from the 
"extended" or enhanced base layer reference frames by utilization and application of 
the enhancement of the upsampled motion vectors using additional enhancement layer 
motion refinement data and the overlapped block motion compensation). 

Regarding claim 32, De Bonet discloses an apparatus for decoding a 
compressed video according to claim 31 , further comprising scalable decoding means 
for decoding the frame residuals, the scalable decoding means selected from the group 
consisting of DCT based decoding means or wavelet based decoding means (col. 16, 
ln.52-55; clearly, DCT based decoding must be done for proper decoding since DCT 
based coding is used in the encoding stage, and frame residuals are decoded in 
elements 1 1 1 2 and 1 1 36 of fig. 1 1 A and element 11 63 of fig. 1 1 B). 

Regarding claim 33, De Bonet discloses an apparatus for decoding a 
compressed video according to claim 32, further comprising: 

means for generating enhancement layer frames from the frame residuals (fig. 10 
is a general description of the enhancement layer decoder, where fig.1 1A and 1 1 B 
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show the specifics of generating enhancement layer frames from I, P and B frame 
residuals); and 

means for generating an enhanced video from the base layer frames and the 
enhancement layer frames (fig.2, note element 260 is the layered video decoder that 
generates an enhanced video from the utilization of the base layer decoder module 
270 and the enhancement layer decoder module 280, where in col. 17, ln.44-50, 
fig.1 1 B, element 1 184 is where the appending of the high resolution frames occur and 
preparation of the high resolution frames for viewing at element 1 187 is sent to a 
display like element 290 of fig.2 or element 146 of fig.1 ). 

Regarding claim 34, De Bonet discloses an apparatus for decoding a 
compressed video according to claim 31 , wherein the frame residuals include B frame 
residuals (fig.1 IB, note "Residual B-frames" is decoded in element 1163). 

Regarding claim 35, De Bonet discloses an apparatus for decoding a 
compressed video according to claim 34, wherein the frame residuals further include P 
frame residuals (fig.1 1A, note "Residual P-frames" is decoded in element 1136). 

Regarding claim 36, De Bonet discloses an apparatus for decoding a 

compressed video according to claim 31 , wherein the frame residuals include P frame 

residuals (fig. 11 A, note "Residual P-frames" is decoded in element 1136). 

Claim Rejections - 35 USC § 103 

5. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
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invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 



6. Claims 3, 15 and 27 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over De Bonet (6,510,177) in view of Wu (6,614,936). 

Regarding claims 3, 1 5 and 27, De Bonet discloses further the step of coding the 
frame residuals (fig.6, element 660). De Bonet does not specifically disclose the step of 
coding the frame residuals with a fine granular scalable codec to generate fine granular 
scalable enhancement layer frames. However, Wu teaches the use of progressive fine 
granular scalable codec to generate fine granular scalable enhancement layer frames 
(col. 5, ln.23-33). Therefore, it would have been obvious to one of ordinary skill in the art 
to combine Wu's teaching of progressive fine-granularity scalable encoding scheme 
with De Bonet's video encoding system for yield high encoding efficiency and good error 
recovery, especially during transmission over the Internet and wireless channels (col.6, 
ln.66 to col.7, ln.2). 



7. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. 

Puri et al. (US 6,148,026) disclose mesh node coding to enable object based 
functionalities within a motion compensated transform video coder. 



Conclusion 
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Contact Information 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Allen Wong whose telephone number is (703) 306-5978. 
The examiner can normally be reached on Mondays to Thursdays from 8am-6pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Christopher Kelley can be reached on (703) 305-4856. The fax phone 
number for the organization where this application or proceeding is assigned is 703- 
872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status infonnation for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). ^ ^ 
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